The Super Annotator: A Method of Semi-Automated Rare Event Identification for Large Clinical Data Sets

نویسندگان

  • Patrick R. Alba
  • Olga Patterson
  • Benjamin Viernes
  • Daniel W. Denhalter
  • Nicole Bailey
  • Andrew Wilson
  • Aaron W. C. Kamauu
  • Scott L. DuVall
چکیده

Detecting rare events in an electronic medical record (EMR) is similar to searching for a needle in a haystack. Automated methods (natural language processing) and manual methods (chart review) can be used. However, when events are rare and the patient population is large, manual annotation may not be feasible and NLP alone may not be able to reach acceptable levels of performance. We propose a semi-automated method for NLP-assisted retrieval of relevant documents and manual review of the resulting instances. This method was developed to identify medication-related adverse events among patients receiving care in the U.S. Department of Veterans Affairs. METHODS

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Creation of an Annotated German Broadcast Speech Database for Spoken Document Retrieval

In this paper we present a semi-automatic method for creating annotated data sets from German-language broadcast resources for which audio files as well as transcripts are available on the Internet. The transcripts are required to be reasonably accurate, but not perfect. Our approach is implemented by a integrated bundle of data processing tools, which support the human annotator in the creatio...

متن کامل

Partial Parsing as a Method to Expedite Dependency Annotation of a Hindi Treebank

The paper describes an approach to expedite the process of manual annotation of a Hindi dependency treebank which is currently under development. We propose a way by which consistency among a set of manual annotators could be improved. Furthermore, we show that our setup can also prove useful for evaluating when an inexperienced annotator is ready to start participating in the production of the...

متن کامل

Prioritization of Supply Chain Risks in Automotive Industry

Supply chains are constantly exposed to various risks. An incident or uncertain event, which has positive or negative effect on the objectives of a project, is called a risk. According to this identification, analysis and prioritization of risks may have a significant role in the success of the project. The purpose of risk management is to reduce the risks of non-achievement of these object...

متن کامل

Prioritization of Supply Chain Risks in Automotive Industry

Supply chains are constantly exposed to various risks. An incident or uncertain event, which has positive or negative effect on the objectives of a project, is called a risk. According to this identification, analysis and prioritization of risks may have a significant role in the success of the project. The purpose of risk management is to reduce the risks of non-achievement of these object...

متن کامل

I-1: Screening of Subfertile Men for Testicularlar Carcinoma In Situ by An Automated Image Analysis-Based Cytological Test of The Ejaculate

Background: Testicular cancer (TC) is usually diagnosed after manifestation of an overt tumour. Tumour formation is preceded by a pre-invasive and asymptomatic stage, carcinoma in situ (CIS) testis, except for very rare subtypes. The CIS cells are located within seminiferous tubules but can be exfoliated and detected in ejaculates with specific CIS markers. Materials and Methods: We have built ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016